Learning Natural Language: A review of formal and computational approaches

نویسندگان

  • Nancy Chang
  • Umesh Vazirani
چکیده

1 Overview Over the last several decades, interaction between the elds of computational learning theory and linguistics has produced a sizable body of research. The question of how people learn to understand and use language cannot be answered without rst considering what a language is, and what it means to learn something, let alone understand it. The formalization of the notion of language learning has been central to much work in inductive inference, including both algorithms for learning particular language classes under a variety of conditions and proofs that such algorithms don't exist. These theoretical investigations of learnabilty have, in turn, proven useful in linguistic theory for characterizing potential constraints on natural languages. Beyond such cross-fertilization, an independent area of research has sprung up at the intersection of the two elds to focus on the precise theoretical assumptions needed in particular linguistic frameworks to ensure that learning can take place. This paper presents a brief summary of the major work relevant to formal and computational approaches to learning natural languages. The vast majority of research so far has been directed at the acquisition of syntax as captured by formal grammars. We thus review early theoretical work on identiication of formal languages in general (Section 2) before describing subsequent work in which speciic linguistic assumptions render the problem more tractable (Section 3). A number of problems with such assumptions, however, have led to some alternative recent approaches. In particular, data-driven methods have tried to demonstrate that no language-speciic assumptions are necessary, especially in a probabilistic framework. Challenges have also risen in response to the focus on syntax and the paradigm of language learning as identiication; semantic information not only simpliies the learning problem but is also more appropriate to the non-binary nature of language use and understanding. 2 Formalities Grammatical inference as a theoretical eld of study began with Gold's (1967) attempt to formalize the acquisition of natural language. We assume the now-standard deenition of a language as the set of ((nite) strings generated by a set of production rules, or grammar, based on the symbols of a given alphabet. That is, knowledge of a language can be identiied with knowledge of the grammar that generates it, and learning a language given example strings (or sentences) is a process of inductive inference of such a grammar. The nature of the grammar determines where in the Chomsky hierarchy of complexity the language …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prediction of Iranian EFL Learners’ Learning Approaches Through Their Teachers’ Narrative Intelligence and Teaching Styles: A Structural Equation Modelling Analysis

It goes without saying that there are many influential factors affecting the success of any learning experience, and teachers are definitely among the significant factors influencing the process of teaching and learning. In this respect, the present study sought to investigate the prediction of Iranian English as a Foreign Language (EFL) learners' learning approaches through their teachers’ nar...

متن کامل

The Effect of Dynamic Assessment of Language Learning: A Review of Literature

Researchers have historically noted the importance of Dynamic Assessment and its effect on students’ language learning. DA offers teachers and learners vast opportunities for language teaching and learning. The present article can be considered as part of the recent trend in the field of language teaching. It attempts to describe Dynamic Assessment and review the literature on the effect of DA ...

متن کامل

Grammar Inference , Automata Induction , and Language

The natural language learning problem has attracted the attention of researchers for several decades. Computational and formal models of language acquisition have provided some preliminary, yet promising insights of how children learn the language of their community. Further, these formal models also provide an operational framework for the numerous practical applications of language learning. ...

متن کامل

Grammar Inference Automata Induction and Language Acquisition

The natural language learning problem has attracted the attention of researchers for several decades Computational and formal models of language acquisition have provided some preliminary yet promising insights of how children learn the lan guage of their community Further these formal models also provide an operational framework for the numerous practical applications of language learning We w...

متن کامل

Book Review: "Literature and Language Learning in the EFL Classroom"

Literature and Language Learning in the EFL Classroomconsists of nineteenchapters. The chapters of the book have been arranged into two parts: Part I, current issues and suggestions for new approaches (Chapters 1-6) and Part II, empirical and case studies (Chapters 7-19). The book takes multiple approaches to examine how literary texts can be incorporated into teaching practices inan EFLclassro...

متن کامل

Protein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches

DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007